IGG3: a tool to rapidly integrate large genotype datasets for whole-genome imputation and individual-level meta-analysis

نویسندگان

  • Miao-Xin Li
  • Lin Jiang
  • Patrick Yu-Ping Kao
  • Pak Chung Sham
  • You-Qiang Song
چکیده

SUMMARY There is an urgent and increasing demand for integrating large genotype datasets across genome-wide association studies and HapMap project for whole-genome imputation and individual-level meta-analysis. A new algorithm was developed to efficiently merge raw genotypes across large datasets and implemented in the latest version of IGG, IGG3. In addition, IGG3 can integrate the latest phased and unphased HapMap genotypes and can flexibly generate complete sets of input files for six popular genotype imputation tools. We demonstrated the efficiency of IGG3 by simulation tests, which could rapidly merge genotypes in tens of thousands of large genotype chips (e.g. Affymetrix Genome-Wide Human SNP Array 6.0 and Illumina Human1m-duo) and in HapMap III project on an ordinary desktop computer. AVAILABILITY (http://bioinfo.hku.hk/iggweb) (version 3.0).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

The Impact of Imputation on Meta-Analysis of Genome-Wide Association Studies

Genotype imputation is often used in the meta-analysis of genome-wide association studies (GWAS), for combining data from different studies and/or genotyping platforms, in order to improve the ability for detecting disease variants with small to moderate effects. However, how genotype imputation affects the performance of the meta-analysis of GWAS is largely unknown. In this study, we investiga...

متن کامل

The Pattern of Linkage Disequilibrium in Livestock Genome

Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...

متن کامل

Genotype imputation.

Genotype imputation is now an essential tool in the analysis of genome-wide association scans. This technique allows geneticists to accurately evaluate the evidence for association at genetic markers that are not directly genotyped. Genotype imputation is particularly useful for combining results across studies that rely on different genotyping platforms but also increases the power of individu...

متن کامل

Evaluating Imputation Algorithms for Low-Depth Genotyping-By-Sequencing (GBS) Data

Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-model species with few genomic resources, high-throughput sequencing (HTS) methods, such as Genotyping-By-Sequencing (GBS), offer an inexpensive alternative to array-based genotyping. Although affordable, datasets derived from HTS methods suffer from sequencing error, alignment errors, and missing ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 25 11  شماره 

صفحات  -

تاریخ انتشار 2009